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(57) Abstract 

The present invention provides a mammalian cell in which expression of a gene coding for a serine protease, especially hepsin, has 
been functionnally interrupted or suppressed. In a preferred embodiment, suppression of the serine protease gene, e.g., a hepsin gene, is 
achieved by, the insertion into it, a nucleotide sequence effective to disrupt its expression. The invention especially relates to a mammal 
containing one or more cells in which expression of the serine protease gene is functionally inactivated or interrupted. 
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KNOCKOUT MICE AND THEIR PROGENY WITH 
DISRUPTED SERINE PROTEASE GENES 

BACKGROUND OF THE INVENTION 

5 

Proteases participate in a variety of developmental and metabolic processes 
(Stroud R, ScL Am. 231:74-88 (1974) and Neurath H, Science 224:350-357 (1984). 
Molecular defects that alter enzyme function often lead to serious human diseases, 
such as bleeding, thrombosis, and atherosclerosis. Hepsin is a novel serine protease 

io of the trypsin family and contains a transmembrane domain near its amino-terminus, 

Kurachi et aL, Methods in Enzymology, 244:100-114 (1994). This structural feature 
distinguishes hepsin from most other serine proteases. In the fruitfly Drosophiia 
rieianc lister, another transmembrane serine protease, Stubble-s^oblcid, has been 
reported V.at has similar structural features as that of hepsin. Stubble-stubbloid protein 

15 plays an important role in epithelial morphogenesis and development in Drosophiia. 

Defects in Stubble-stubbloid gene cause malformation of legs, wings and bristles, Appel 
et aL, Proc. Natl. Acad. Sci. USA 90:4937-4941 (1993). 

The human hepsin cDNA was initially isolated from a liver cDNA library 
screened with a mixture of oligonucleotides based on a consensus sequence of serine 

20 proteases, Leytus et al., Biochemistry 27:1067-1074 (1988). Biochemical studies 

indicate that hepsin is a type II transmembrane serine protease expressed mainly on 
the surface of hepatocytes, Tsugi et al., J. Biol. Chem. 266:16948-16953 (1991). Lower 
levels of hepsin mRNA are detected in other tissues including lung, kidney, pancreas, 
stomach, thyroid and prostate. In addition, hepsin mRNA is present in several human 

25 tumor cell lines, such as hepatoma cells HepG2 and PLC/PRF/5, mammary cancer 

cells MCF784 and T470, and epitheloid carcinoma cells HeLa S3, Tsuji et aL, J. Biol. 
Chem. 266:16948-16953 (1991) and Torres-Rosado et aL, Proc. Natl. Acad. Sci. USA 
90:7181-7185 (1993). 

Hepsin has a number of reported activities. In an in vitro study, recombinant 

30 human hepsin expressed on the cell surface activated blood coagulation factor VII but 

not factors IX, X, prothrombin or protein C, all of which share significant structural and 
sequence similarities with factor VII. The activation of factor VII by hepsin was shown 
to be sufficient to initiate the coagulation pathway leading to thrombin formation, 
Kazama et aL, J. Biol. Chem. 270:66-72 (1995). Elevated plasma factor Vila activity 
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has been known to be a significant risk factor for ischemic heart disease and 
cardiovascular death, Hultin, Prog. Hemostasis Thrombosis 10:215-241, (1991) and 
Mann, Arteriosclerosis 9:783-784 (1989). Factor Vila/tissue factor complex also 
contributes to tumor-related hypercoagulability and intravascular thrombosis, Edwards 

5 et at., Thromb. Haemostasis 69:205-213 (1993). 

In addition to blood coagulation, hepsin was reported to be critical for cell 
growth. In a cell culture system, addition of anti-hepsin antibodies or hepsin-specific 
antisense oligonucleotides to the culture medium significantly inhibited growth of 
hepatoma cells , Torres-Rosado et al. f Proc. Natl. Acad. Set. USA, 90:7181-7185 

10 (1993). This observation is quite interesting in light of the expression of hepsin mRNA 

in a number of tumor cells. The growth factor-like activities of serine proteases have 
been known for many years. For example, thrombin is a potent mitogen for vascular 
fibroblasts and smooth muscle cells, Fenton J, Ann. N.Y. Acad. ScL t 485:5- 1 5 1 1986). 
Furthermore, serine proteases also participate in processing of growth factors, 

15 Massague J, J. Biol. Chem., 265:21393-21396 (1990). The hepsin-dependent tumor 

cell growth indicates a mechanism in which hepsin functions either directly as a growth 
factor or indirectly as an enzyme that processes certain growth factors essential for cell 
growth. 

20 DESCRIPTION OF THE DRAWINGS 

Fig. 1 (A-C) shows a wild-type mouse hepsin gene (A), a targeting vector (B) 
designed to target the mouse hepsin gene, and a modified mouse hepsin gene (C) 
which has been modified by the targeting vector. 

Fig. 2 shows a full-length cDNA and amino acid coding sequence for a mouse 
25 hepsin gene (Seq. ID No. 1 and Seq. ID No, 2), 

Fig. 3. Serum alkaline phosphatase levels in hepsin-deficient and control mice. 
Serum samples were prepared from male (upper panel) and female (lower panel) 
hepsin-/- mice and wild-type litter mates at 10, 12 and 14-16 weeks of age. The 
number of mice in each group is 4 to 5. Total serum alkaline phosphatase activity was 
30 measured by hydrolysis of p-nitrophenylphosphate at pH 9.2, McComb, R.B., Bowers, 

Jr., G.N., Posen, S. in Alkaline Phosphatase, Plesum Press, New York, 1979. Data are 
presented as mean ± SD. Statistical differences between hepsin-/- mice (open bars) 
and wild-type litter mates (filled bars) are indicated by asterisks, *P < 0.05, **P < 0.01 
by Student's t test. 
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Fig. 4. Analysis of alkaline phosphatase isoenzymes-. Serum samples were 
collected from 14-16 week-old female mice. Total serum alkaline phosphatase activity 
was assayed by hydrolysis of p-nitrophenylphosphate at pH 9.2, McComb, R.B., 
Bowers, Jr., G.N., Posen, S. in Alkaline Phosphatase, Plenum Press, New York, 1979. 
Determination of activity from isoenzymes was based on a quantitative assay described 
by Hoffmann W.E., Everds, N., Pignatello, M. t and Solter, P.F., Toxicol. PathoL, 22:633- 
638, 1994. Data are presented as mean ± SD from 8 wild-type (filled bars) and 14 
hepsin-/- (open bars) mice. Statistical differences are indicated by asterisks, **P < 0.01 
by Student's t test. 

DESCRIPTION OF THE INVENTION 

The present invention relates to a serine protease, especially hepsin, and a 
nucleic acid coding for it. In one aspect, the invention provides a mammalian cell m 
which expression of a gene coding for a serine protease, especially hepsin, has been 
"knocked out" or disrupted. In a preferred embodiment, disruption of the serine 
protease gene, e.g., a hepsin gene, is achieved by the insertion of a nucleotide 
sequence into it which is effective to suppress the gene's expression. 

The invention especially relates to a mammal containing one or more cells in 
which expression of the serine protease gene is functionally inactivated or disrupted. 
Functional inactivation or disruption refers, e.g., to a partial or complete reduction of the 
expression of at least a portion of a polypeptide encoded by an endogenous serine 
protease gene of a single cell, selected cells, or all of the cells of a mammal. The term 
"knockout" is a synonym for functional inactivation of the gene. 

In one embodiment, a gene targeting strategy is utilized that facilitates the 
introduction of a desired nucleotide sequence into a serine protease gene, preferably a 
hepsin gene. The gene targeting strategy preferably utilizes double reciprocal 
recombination and a positive selectable marker to assist in the insertion of the 
nucleotide sequence into a target nucleic acid. The target nucleic acid is preferably a 
gene, more preferably a gene at its particular chromosomal locus. The desired 
nucleotide sequence is inserted into the gene in such a way that the gene is functionally 
disrupted, i.e., its expression is partially or completely reduced. 

In one aspect of the invention, a targeting vector is employed to insert a 
selectable marker into a predefined position of a gene, preferably a serine protease 
such as hepsin. The position is selected to achieve functional disruption of the gene 
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upon insertion of the selectable marker. For such purposes, a preferred embodiment is 
a recombinant nucleic acid molecule comprising: (1) a 5' nucleotide sequence which is 
effective to achieve homologous recombination at a first predefined position of a 
mammalian serine protease gene, e.g., hepsin. operably linked to (2) the 5' terminus of 
a first selectable nucleotide sequence which confers a first selection characteristic on a 
cell in which it is present, and (3) a 3' nucleotide sequence which is effective to achieve 
homologous recombination at a second predefined position of the mammalian serine 
protease gene, e.g., hepsin gene, operably linked to the 3' terminus of the first 
selectable nucleotide sequence. The recombinant nucleic acid molecule is effective to 
achieve homologous recombination in a mammalian chromosome at predefined 
location. Fragments of the targeting vector are also within the scope of the invention, 
e.g., recombinant nucleic acid molecules comprising elements (1) and (2), or 
comprising elements (2) and (3), etc. 

The term recombinant refers, e.g., to a nucleic acid molecule which has been 
modified by the hand-of-man, e.g., comprising fragments of nucleic acid from different 
sources or a nucleic acid molecule from one source which has been engineered. Thus, 
the nucleic acid molecule is recombinant, e.g., because it comprises nucleotide 
sequences from a mammalian hepsin gene and selectable marker gene. A molecule is 
also recombinant when it contains sequences from the same gene but arranged in a 
manner that does not occur in nature, i.e., a non-naturally-occurring arrangement. 

Homologous recombination refers to the process in which nucleic acid 
molecules with similar genetic information line up side-by-side and exchange nucleotide 
strands. A nucleotide sequence of the recombinant nucleic acid which is effective to 
achieve homologous recombination at a predefined position of a target nucleic acid 
therefore indicates a nucleotide sequence which facilitates the exchange of nucleotide 
strands between the recombinant nucleic acid molecule at a defined position of a target 
gene, e.g., a mouse hepsin gene. The effective nucleotide sequence generally 
comprises a nucleotide sequence which is complementary to a desired target nucleic 
acid molecule (e.g., the gene locus to be modified), promoting nucleotide base pairing. 
Any nucleotide sequence can be employed as long as it facilitates homologous 
recombination at a specific and selected position along the target nucleic acid molecule. 
Generally, there is an exponential dependence of targeting efficiency on the extent or 
length of homology between the targeting vector and the target locus. Selection and 
use of sequences effective for homologous recombination is described, e.g., in Deng 
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and Capecchi, Mol. Cell. Bio., 12:3365-3371, 1992; Bollag etaL, Annu. Rev. Genet, 
23:199-225, 1989; Waldman and Liskay, Mol. Cell. Bio., 8:5350-5357, 1988. 

An aspect of the present invention is to suppress or functionally disrupt 
expression of a serine protease gene. The phrases "disruption of the gene", "gene 

5 disruption," "suppressing expression," "gene suppression," "functional inactivation of 

the gene," or "functional gene inactivation" refer to modification of the gene in manner 
which decreases or prevents expression of that gene and/or its product in a ceil. The 
expression of the gene's product can be completely or only partially suppressed, e.g., 
reduced by 70%, 80%, 85%, 90% t 95%, 99%, or more. A functionally disrupted gene, 

10 e.g., a functionally disrupted hepsin gene, includes a modified gene which expresses a 

truncated polypeptide having less than the entire coding sequence of the wild-type 
gene. Such a gene is illustrated in the examples below. See, e.g., Fig. 1. A gene can 
also be functionally disrupted by affecting its mRNA structure in such a way to create 
an untranslatable message, e.g., frame-shift, decreased stability, etc. 

15 In accordance with the present invention, a serine protease gene is modified in 

such a manner which is effective to disrupt expression of the corresponding gene 
product. Thus, e.g., a functionally disrupted recombinant hepsin gene does not express 
a functional hepsin polypeptide or expresses a functional hepsin polypeptide at levels 
which are less than wild-type levels of hepsin, e.g.,-, reduced by 70%, 80%, 85%, 90%, 

20 95%, 99%, or more. By "not functional" or "functionally inactive" hepsin polypeptide; it 

is meant, e.g., that the hepsin is not enzymatically operational. See, Background for a 
description of hepsin enzymatic activity. The gene can be modified in any effective 
position, e.g., enhancers, promoters, regulatory regions, noncoding sequences, coding 
sequences, introns, exons, etc., so as to decrease or prevent expression of that gene in 

25 the cell. Insertion into a region of a serine protease gene, e.g., a murine hepsin gene, is 

usually accomplished by homologous recombination. A recombinant nucleic acid 
molecule comprising regions of gene homology and a nucleotide sequence coding for a 
selectable marker gene is inserted into the promoter and/or coding region and/or 
noncoding regions of a serine protease, whereby expression of the gene is functionally 

30 disrupted. When this knockout construct is then inserted into a cell, the construct can 

integrate into the genomic DNA. Thus, progeny of the cell will only express only one 
functional copy of the gene; the other copy will no longer express the gene product, or 
will express it at a decreased level, as the endogenous nucleotide sequence of the 
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gene is now disrupted by the inserted nucleotide sequence. If desired the functional 
gene can be inactivated in a second analogous step. 

The nucleotide sequence effective for homologous recombination is operably 
linked to a nucleotide sequence, preferably a selectable marker nucleotide sequence or 
5 gene, which is to be inserted into the desired target nucleic acid. For example, an 

aspect of the present invention is to replace all or part of the nucleotide coding 
sequence for a hepsin polypeptide of the mouse hepsin gene with a nucleotide 
sequence for a selectable marker. In the examples, exons 5-8 of the mouse hepsin 
gene are replaced by a gene coding for neomycin resistance. Its introduction into the 

10 gene is achieved by attaching a part of the mouse hepsin gene comprising exon 4 to 

the 5' terminus of a neomycin resistance gene and a part of the mouse hepsin gene 
comprising exons 9-12 to the 3' terminus of the selectable marker to form a targeting 
vector. The mouse hepsin gene segments are positioned with respect to the selectable 
marker gene in a way such that homologous recombination between them and the 

15 mouse gene will result in replacement of mouse exons 5-8 with the selectable marker 

gene. Such positioning, i.e., operable linkage, means that the mouse gene segment is 
joined to the selectable marker gene whereby the homologous recombination function 
can be accomplished. A recombinant nucleic acid molecule of the present invention is 
effective for disrupting expression of a mammalian gene, preferably a mammalian 

20 hepsin gene, such as a murine hepsin gene. This means, e.g., that the molecule 

comprises sequences effective for homologous recombination into the desired nucleic 
acid target and that modification of the target by the effective molecule results in a 
modified target in which expression of the gene product is disrupted. Fig. 1 shows an 
example of a targeting vector which is effective for disrupting expression of a murine 

25 hepsin gene. 

The recombinant nucleic acid is preferably inserted into a cell with chromosomal 
DNA that contains the endogenous gene to be knocked out. In the cell, the recombinant 
nucleic acid molecule can integrate by homologous recombination with the DNA of the 
cell in such a position so as to prevent or interrupt transcription of the gene to be 

30 knocked out. Such insertion usually occurs by homologous recombination (i.e., regions 

of the targeting vector that are homologous or complimentary to endogenous DNA 
sequences hybridize to each other when the targeting vector is inserted into the cell; 
these regions can then recombine so that part of the targeting vector is incorporated 
into the corresponding position of the endogenous DNA). 
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As discussed, one or more nucleotide sequences can be inserted into a serine 
protease gene to suppress its expression. It is desirable to detect the presence of the 
nucleotide sequence in the gene. Such detection can be accomplished in various 
ways, including by nucleic acid hybridization, antibody binding to an epitope encoded by 

5 the inserted nucleic acid, or by selection for a phenotype of the inserted sequence. 

Accordingly, such an inserted nucleotide sequence can be referred to as a first 
selectable nucleotide sequence. A first selectable nucleotide sequence preferably 
confers a first selection characteristic on a cell in which it is present By the phrase 
"selection characteristic," it is meant, e.g., a characteristic which is expressed in a cell 

10 and which can be chosen in preference to another or other characteristics. The 

selectable nucleotide sequence, also known as selectable marker gene, can be any 
nucleic acid molecule that is detectable and/or assayable after it has been incorporated 
into the genomic DNA of the mammal. The selection characteristic can be a positive 
characteristic, i.e., a characteristic which is expressed or acquired by cells and whose 

15 presence enables selection of such cells. A positive selection characteristic can enable 

survival of the cell or organism, e.g., antibiotic resistance, ouabain-resistance (a gene 
for a ouabain-resistant sodium/potassium ATPase protein). Examples of positive 
selection characteristics and a corresponding selection agent include, e.g., Neo and 
G418 or kanomycin; Hyg and hygromycln, hisD and histidinol; Gpt and xanthine; Ble 

20 and bleomycin; Hprt and hypoxanthine. See, e.g., USP No. 5,464,764 and Capecchi, 

Science, 244:1288-1292, 1989. The presence of the selectable gene in the targeted 
sequence can also be identified by using binding ligands which recognize a product of 
the selectable gene, e.g., an antibody can be used to identify a polypeptide product 
coded for by the selectable gene, an appropriate ligand can be used to identify 

25 expression of a receptor polypeptide coded for by the selectable gene, or by assaying 

for expression of an enzyme coded for by the selectable gene. Preferably, the 
selectable marker gene encodes a polypeptide that does not naturally occur in the 
mammal. 

The selectable marker gene can be operably linked to its own promoter or to 
30 another promoter from any source that will be active or can easily be activated in the 

cell into which it is inserted. However, the selectable marker gene need not have its 
own promoter attached, as it may be transcribed using the promoter of the gene into 
which it is inserted. The selectable marker gene can comprise one or more sequences 
to drive and/or assist in its expression, including, e.g., ribosome-recognition sequences, 
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enhancer sequences, sequences that confer stability to the polypeptide or RNA, and/or 
a polyA sequence attached to its 3' end to terminate transcription of the gene. 

A positive selectable marker facilitates selection for recombinants in which the 
positive selectable marker has integrated into the target nucleic acid by homologous 
recombination. A gene targeting vector in accordance with the present invention can 
also further comprise a second selection characteristic coded for by a second 
selectable gene to further assist in the selection of correctly targeted recombinants. A 
negative selection marker permits selection against cells in which only non-homologous 
recombination has occurred. In one preferred embodiment the second selectable 
marker gene confers a negative selection characteristic upon a cell in which it has been 
introduced. Such negative selection characteristics can be arranged in the targeting 
vector in such a way that it can be utilized to discriminate between random integration 
events and homologous recombination. By the term negative selection, it is meant a 
selection characteristic which, when acquired by the cell, results in its loss of viability 
(i.e., it is lethal to the cell). A nucleoside analog, gancyclovir, which is preferentially 
toxic to cells expressing HSV tk (herpes simplex virus thymidine kinase), can be used 
as a negative selection agent, as it selects for cells which do not have an integrated 
HSV tk selectable marker. FIAU (1,2-deoxy-2-fluoro- -d-arabinofuransyi-5-iodouracil) 
can also be used as a selection agents to select for cells lacking HSV tk. Other 
negative selectable markers can be used analogously. Examples of negative selection 
characteristics and a corresponding thymidine kinase (HSV tk) and acyclovir, 
gancyclovir, or FIAU; Hprt and 6-thioguanine or 6-thioxanthine; diphtheria toxin; ricin 
toxin; cytosine deaminase and 5-fluorocytosine. 

The negative selectable marker is typically arranged on the gene targeting 
vector 5' or 3* to the recombinogenic homology regions so that double-crossover 
replacement recombination of the homology regions transfers the positive selectable 
marker to a predefined location on the target nucleic acid but does not transfer the 
negative selectable marker. For example, a tk cassette can be located at the 3' end of 
a murine hepsin gene, about 150 base pairs from the 3* stop codon. More than one 
negative selectable marker can also be utilized in a targeting vector. The positioning of, 
for example, two negative selection vectors at the 5* and 3' ends of a targeting vector 
further enhances selection against target cells which have randomly integrated the 
vector. Random integration sometimes results in the rearrangement of the vector, 
resulting- in excision of all or part of the negative selectable marker prior to random 
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integration. When this occurs, negative selection cannot be used to eliminate those 
cells which have incorporated the targeting vector but by random integration rather than 
homologous recombination. The use of more than one negative selectable marker 
substantially enhances the likelihood that random integration will result in the insertion 
of at least one of the negative selectable markers. For such purposes, the negative 
selectable markers can be the same or different 

The use of a positive-negative selection scheme reduces the background of 
cells having incorrectly integrated targeted construct sequences. Positive-negative 
selection typically involves the use of two active selectable markers: (1) a positive 
selectable marker (e.g., neo) that can be stably expressed following random integration 
or homologous targeting, and (2) a negative selectable marker (e.g., tk) that can only 
be stably expressed following random integration. By combining both positive and 
negative selection, host cells having the correctly targeted homologous recombination 
event can be efficiently obtained. Positive-negative selection schemes are described, 
e.g., in USP No. 5,464,764; WO 94/06908. It is recognized, however, that one or more 
negative selectable markers are not required to carry out the present invention, e.g., 
produce a transgenic animal in which a serine protease gene is functionally inactivated 
or disrupted. 

A recombinant nucleic acid molecule according to the present invention can also 
comprise all or part of a vector. A vector is, e.g., a nucleic acid molecule which can 
replicate autonomously in a host cell, e.g., containing an origin of replication. • Vectors 
can be useful to perform manipulations, to propagate, and/or obtain large quantities of 
the recombinant molecule in a desired host. A skilled worker can select a vector 
depending on the purpose desired, e.g., to propagate the recombinant molecule in 
bacteria, yeast, insect, or mammalian cells. The following vectors are provided by way 
of example. Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pBS, pD10, Phagescript, 
phiX174, pBK Phagemid, pNH8A, pNH16a, pNH18Z, pNH46A (Stratagene); Bluescript 
KS+II (Stratagene); ptrc99a, pKK223-3, pKK233-3 t pDR540, pRITS (Pharmacia). 
Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXTl, pSG (Stratagene), pSVK3, pBPV, 
pMSG, pSVL (Pharmacia). However, any other vector, e.g., plasmids, viruses, or parts 
thereof, may be used as long as they are replicable and viable in the desired host. The 
vector can also comprise sequences which enable it to replicate in the host whose 
genome. is to be modified. The use of such vector can expand the interaction period 
during which recombination can occur, increasing the targeting efficiency. An example 
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of a gene targeting vector that can be used in accordance with the present invention is 
described in Molecular Biology, ed. by F.M. Ausubel et a!., Unit 9.16, Fig. 9.16.1 
(pNTK). 

In accordance with an aspect of the present invention, the function of a serine 
5 protease gene, such as a hepsin gene, is disrupted or knocked out by the insertion of 

an exogenous or heterologous sequence into it, interrupting its function. For example, 
the exogenous or heterologous sequence can be inserted into a region of the hepsin 
gene before its first start codon. The nucleotide sequence coding for a selectable 
characteristic can be inserted into the hepsin gene in such a manner by homologous 
10 recombination so that it is operably linked to the endogenous promoter of the hepsin 

gene. Upon integration of the selectable marker gene into the desired predefined 
position of the hepsin gene, expression of the selectable characteristic is driven by the 
endogenous hepsin gene promoter, permitting its detection into those cells into which :: 
has integrated. 

15 Th e selectable marker gene can also be integrated at positions downstream of 

3* to the first start codon of the hepsin gene. The hepsin gene can be integrated out-of- 
reading frame or in-reading frame with the hepsin polypeptide so that a fusion 
polypeptide is made, where the fusion polypeptide is less active than the normal 
product. By detecting only those cells which express the characteristic, cells can be 

20 selected which contain the integrated sequence at the desired location. A convenient 

way of carrying out such selection is using antibiotic resistance. In the examples below, 
neomycin resistance is utilized as the selectable characteristic. Cells grown in the 
presence of a toxic concentration of neomycin will normally die. Acquisition of the 
neomycin resistance gene by homologous recombination rescues cells from the lethal 

25 effect, thereby facilitating their selection. 

The serine protease gene is knocked out or functionally interrupted by the 
integration event. The insertion of the selectable gene ahead of the hepsin coding 
sequence effectively isolates it from a promoter sequence, disabling its expression. If 
the selectable gene contains a transcription terminator, then transcription of the gene 

30 using the hepsin promoter will terminate immediately after it and will rarely result in the 

transcription of the hepsin coding sequence. The serine protease gene can also be 
knocked out by a deletion without a replacement, such as a site-directed deletion of a 
part of the gene. Deleted regions can be coding regions of regulating regions of the 
gene. 
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A hepsin gene can be modified at any desired position. It can be modified so 
that a truncated hepsin polypeptide is produced having one or more activities of the 
complete hepsin polypeptide. As already discussed, such a modified gene is a 
functionally disrupted gene. In the examples below, exons 5-8 of a murine hepsin gene 
are replaced by a neomycin resistance. Expression of the modified gene could result in 
a truncated polypeptide comprising exons 1-4 of hepsin. This polypeptide would 
comprise a transmembrane domain from amino acids 18-44; See, Leytus et al. v 
Biochemistry, 27:1067-1074 (1988), but lack the catalytic domains and therefore would 
be functionally inactive. See Background for a description of hepsin activity. It is 
believed that such a truncated polypeptide would not be produced at all. The truncated 
mRNA would lack its 3' end which normally confers stability to the message. 
Consequently, the resultant mRNA would be unstable and therefore rarely, if at all, 
available for translation. Such truncated mRNA was not detected in RNA blots. The 
hepsin gene can be functionally interrupted so that a resultant product lacks the . 
transmembrane domain completely or partially and/or comprises all or part of the 
catalytic domains, etc., as desired, and therefore lacks all or part of its activity, 
especially its enzymatic activity. 

If desired, the insertion(s) can be removed from the recombinant gene. In the 
example, a neomycin cassette replaced exons of the mouse hepsin gene to functionally 
inactivate it. The neomycin cassette can be subsequently removed from the hepsin 
gene, e.g., using a recombinase system. The Cre-lox site specific recombination 
system is especially useful for removing sequences from a recombinant gene. To 
utilize the Cre-lox system, recombinase recognition sites are integrated into the 
chromosome along with the selectable gene to facilitate its removal at a subsequent 
time. For guidance on recombinase excision systems, see, e.g., USP Nos. 5,626,159, 
5,527,695, and 5,434,066. See also, Orban, P.C., et al., "Tissue-and Site-Specific DNA 
Recombination in Transgenic Mice", Proc. Natl. Acad. Sci. USA, 89:6861-6865 (1992); 
O'Gorman, S., et al., "Recombinase-Mediated Gene Activation and Site-Specific 
Integration in Mammalian Cells", Science, 251:1351-1355 (1991); Sauer, B., et al., 
"Cre-stimulated recombination at loxP-Containing DNA sequences placed into the 
mammalian genome", Nucleic Acids Research, 17(1):147-161 (1989). 

In the example, after excision of the neo gene, the splicing machinery could 
bypass the remaining part of exon 9, connecting the 3' end of exon 8 to the 5' end of 
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exon 13. Thus, even if a correct reading frame were restored, the encoded protein still 
would lack the active enzymatic site. 

A nucleic acid comprising a nucleotide sequence coding without interruption 
means that the nucleotide sequence contains an amino acid coding sequence for a 
5 polypeptide, with no non-coding nucleotides interrupting or intervening in the coding 

sequence, e.g., absent intron(s) or the noncoding sequence, as in a cDNA. 

The present invention also relates to an isolated mouse hepsin or biologically- 
active fragments thereof. The term "isolated" means that the material is in a form in 
which it is not found in its original environment, e.g., it is more concentrated, more 

10 purified, or separated from components, etc. The fragments are preferably biologically- 

active. By biologically-active, it is meant that the polypeptide fragment possesses an 
activity in a living system or with components of a living system. Biological-activities of 
hepsin or a fragment thereof include: immunogenic activity, targeting to the cell 
membrane, insertion in the cell membrane (e.g., where the fragment comprises the N- 

15 terminal transmembrane domain), cell signaling (e.g., in morphogenesis or 

development, e.g., of bone or cartilage), activation of blood coagulation factor VII, 
tumor-related hypercoagulability, hydrolysis of synthetic substrates, and/or growth 
factor activity. For these and other hepsin activities, see also the Background section 
above and the references discussed therein. A hepsin fragment can contain one or 

20 more activities. Immunogenic activity means that the polypeptide binds to hepsin 

specific antibodies or is capable of eliciting an immune response specific for a hepsin. 
A hepsin polypeptide, or corresponding nucleic acid coding for it, means a polypeptide 
which can be isolated from a natural source. It therefore includes naturally-occurring 
normal and mutant alleles. Natural sources include, e.g., living cells obtained from 

25 tissues and whole organisms, and cultured cell lines. Fragments can be prepared 

routinely, e.g., by recombinant means or by proteolytic cleavage of isolated 
polypeptides, and then assayed for a desired activity. 

Another aspect of the present invention relates to host cells comprising a 
recombinant nucleic acid of the invention. A cell into which a nucleic acid is introduced 

30 is a transformed cell. Host cells include, mammalian cells, e.g., murine Ltk-, murine 

embryonic stem cells, COS-7, CHO, HeLa, insect cells, such as Sf9 and Drosophila, 
bacteria, such as E. coli, Streptococcus, bacillus, yeast, fungal cells, plants, embryonic 
stem cells (e.g., mammalian, such as mouse or human), neuronal cells (primary or 
immortalized), e.g., NT-2, NT-2N, PC-12, SY-5Y, neuroblastoma. See, also Methods in 
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Enzymology, Volume 185, e<±, D.V. Goeddel. A nucleic acid can be introduced into the 
cell by any effective method including, e.g., calcium phosphate precipitation, 
electroporation, injection, pressure, DEAE-Dextran mediated transfection, fusion with 
liposomes, and viral transfection. When the recombinant nucleic acid is present in a 
mouse cell, it is preferably integrated by homologous recombination into the mouse cell 
gene locus. 

A transformed cell can contain a recombinant gene integrated into its 
chromosome at the targeted gene locus. A targeting vector which comprises 
sequences effective for homologous recombination at a particular gene locus, when 
introduced into a cell under appropriate conditions, will recombine with the homologous 
sequences at the gene locus, introducing a desired selectable gene into it. When 
recombination occurs such that insertion results, the nucleic acid is integrated into the 
gene locus. The gene locus can be the chromosomal locus which is characteristic of 
the species, or it can be a different locus, e.g., translocated to a different chromosomal 
position, on a supernumerary chromosome, on an engineered "chromosome," etc. 

As discussed below, the present invention also relates to transgenic animals- 
containing one or more modified serine protease genes. The transgenic animals 
produced in accordance with the present invention can be used as a source to establish 
primary or established, e.g., immortalized, cell lines according to. various methods as 
the skilled worker would know. Since the animals (either, homozygotes or 
heterozygotes) contain a modified serine protease gene, e.g., hepsin, the 
corresponding cell lines would be expected to have the same genotype. The cell lines 
can be derived from any desired tissue or cell-type, including, e.g., liver, epithelia, 
neuron, fibroblast, mammary, lung, kidney, pancreas, stomach, thyroid, prostate, 
osteoblasts, osteoclasts, osteocytes, osteoprogenitor cells, muscle (e.g., smooth), etc. 

Cell lines produced in accordance with the present invention are useful for a 
variety of purposes. In one aspect of the invention, it is desirable to create panels of 
cell lines which differ in the expression of one or more genes. For example, the present 
invention describes and enables the production of cell lines which lack a serine 
protease gene, such as the hepsin gene. A hepsin-functionally-disrupted cell line 
differs from the parental (i.e., starting) cell line by the expression of the hepsin gene. 
The availability of such pairs of cell lines, i.e., plus or minus for hepsin expression (or 
any other desired gene or genes), is useful to distinguish the effects of hepsin from 
those of other genes products, such as other serine proteases. Hepatocyte lines, such 
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. as HepG2 or PLC/PRF/5 cells, express a variety of different proteases, including, 
thrombin, factors Xa, IXa, and Vila, protein C and hepsin. A cell line functionally- 
disrupted in one or more desired proteases (e.g., hepsin), in combination with the 
parental cell line intact for the desired protease(s), can be employed to specifically 
5 distinguish its activity (e.g., hepsin) from all other proteases. Such genetic dissection 

can be used to develop, e.g., drugs and therapeutics which target a specific gene 
product, e.g., hepsin, thrombin, factors Xa, IX, and Vila, and protein C, the latter 
proteases which play roles in cardiovascular and blood diseases. For example, to 
prevent clots, it is desirable to develop reagents which inhibit a specific protease in a 

10 cellular system. Inhibition of other proteases involved in other biological processes is 

undesirable since it can lead to secondary and unwanted side effects. A protease 
inhibition assay using cells or proteases obtained from, e.g., hepatocytes, would be 
unable to distinguish between the different proteases (i.e., enzymatic activities) 
expressed by the same cells. By using cell lines in which one or more proteases are 

15 genetically knocked out, the effects of such inhibitors can be identified more specifically 

(e.g., an inhibitor which inhibits protease activity equally in the hepsin-functionally 
disrupted cell line and its parent would be considered to be a non-hepsin-specific 
protease inhibitor). For hepsin, such pairs (or more) of cell lines are useful in identifying 
agents to treat pathological conditions in which hepsin activity plays a role, e.g., 

20 ischemic heart disease, cardiovascular deficiency, tumor-related hypercoagulation, 

intravascular thrombosis, cell and growth control, etc. Additionally, since hepsin is a 
cell-surface protein, hepsin-functionaily disrupted cell lines can be combined with 
hepsin-complete cell lines, etc., to study the interactions and identify, e.g., the cell 
surface receptor for the hepsin gene product. These plus/minus screening methods 

25 can be adapted to any desired gene(s) or gene product(s). 

Accordingly, another aspect of the invention relates to a method of identifying an 
agent which modulates an activity of a mammalian gene product, e.g., a serine 
protease such as hepsin. By the phrase "an agent which modulates the activity," it is 
meant that the agent affects the activity. The activity can be affected by reducing it, 

30 eliminating it, activating it, increasing it, changing its specificity, etc. The agent can be 

an agonist, an antagonist, or a partial agonist or antagonist. The agent can be 
administered to a cell, an organ, a tissue, a whole organism, etc. Administration can be 
accomplished in various ways, e.g., by addition of it directly to a medium bathing or 
perfusing the cells, by encapsulating it in a liposome or a cyclodextran, etc., by 
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complexing it to another agent such as a chelating agent, DEAE, etc., by expressing it _ 
from a cell, by a virus, by injection, by inhalation, etc; Generally, the method 
comprises: (1) administering to a first cell in which expression of at least one copy of a 
target gene is functionally disrupted, (2) administering said agent to a second cell which 
contains at least one more copy of the first cell target gene whose expression is 
complete, and (3) measuring the amount of activity of the target gene in the first and 
second cell. The difference in the amount of activity can be used to determine whether 
the agent is selective for the target gene whose function is disrupted. By the phrase 
"expression is complete," it is meant, e.g., that the expression of the gene is wild-type 
or that the gene's product contains 1 or more desired activities. 

Gene functionally-disrupted cell lines can also be utilized to produce transgenic, 
either chimeric, heterozygous, or homozygous, animals, e.g., non-human mammals. 
Such transgenic animals are useful as models to study the physiological role of a 
desired gene and to identify agents which specifically target the desired gene or a 
biological pathway in which it acts. Thus, an aspect of the invention is method of 
administering to a mammal functionally-disrupted for a serine protease, e.g., hepsin, an 
amount of an agent effective to restore the serine protease, e.g., hepsin, function. 
Such agent can be, e.g., the gene, itself, e.g., as in administering the hepsin gene, an 
agonist, or an agent which acts upstream of the serine protease. Such animals are 
also useful for identifying the specific function and/or role of the knocked out gene; 
Hepsin-related disorders include ischemic heart disease, cardiovascular deficiency, 
tumor-related hypercoagulation, intravascular thrombosis, cell and growth control (e.g., 
cancer, hyper- or hypo-proliferative diseases, etc.), bone diseases. The use of animal 
models functionally disrupted for the hepsin gene is useful for identifying agents useful 
in treating such disorders, developing diagnostic agents and criteria for these disorders, 
etc. For example, hepsin-functionally-disrupted mammals, such as mice, are useful to 
identify blood plasma protein levels of hepsin and changes related to hepsin levels, for 
diagnostic and treatment purposes. A transgenic animal and a recombinant nucleic 
acid molecule according to the present invention is useful as described in U.S. Pat. 
Nos. 5,557,032, 5,532,158, 5,304,489, 5,221,778, 5,175,385, 5,175,384, 5,175,383, 
5,087,571, 5,082,779, 4,736,866, 4,873,191, and other transgenic animal patents. 

The invention also relates to the treatment of bone and skeletal disorders (e.g., 
osteoporosis, Paget's disease, osteitis deformans, etc.), especially those associated 
with elevated bone alkaline phosphase ("AP") levels. For example, a transgenic 
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homozygous hepsin "knock-out" animal in accordance with the present invention can be 
used as an animal model for a bone disease. These animals exhibit elevated levels of 
serum alkaline phosphatase ("AP"), especially AP of bone origin. See, e.g., Figs. 3 and 
4. By the phrase "elevated levels of serum AP", it is meant, e.g., an amount of 
5 circulating AP which is increased above the normal range established for animals of the 

same genetic background. The increase 

in AP is statistically significant, e.g., within 95%, preferably 99%, confidence levels as 
measured by a t-test. See, e.g., Figure 3. The amount of elevation can be, e.g., 5%, 
10%, 20%, 50%, etc., above normal. 

io In one embodiment, agents can be administered to transgenic animals to 

determine their effect on the treatment and/or prevention of bone disease. The effect 
can be monitored in various ways, including by direct inspection, e.g., bone X-ray, or 
indirectly by measuring circulating AP levels. The modulation in the AP level indicates 
an effect on the bone disease. The agents can be of any composition desired 

15 (polypeptide, nucleic acid, organic, inorganic, etc.). An agent can be combined with 

another agent known to effect the bone disease, e.g., calcitonin, bisphosphonates, 
gallium nitrate, etc. 

A knockout mouse in accordance with the present invention can also be used as 
a diagnostic tool to develop sensitive assays for alkaline phosphatase. Alkaline 

20 phosphatase is routinely tested and utilized as a clinical marker for a variety of 

diseases, including cancer, bone diseases. Thus t trangenic animals having elevated 
AP levels can be used to design and develop sensitive assays for AP detection. For 
example, the present invention can be utilized to identify agents which modulate 
alkaline phosphatase activity, e.g., increase its activity and therefore enhance its 

25 detection. 

The functionally-disrupted hepsin-gene can also be employed as a marker to 
identify mice which are otherwise undistinguishable. For example, if one desires to 
know the source of an animal (e.g., a mouse in a mixed cage of animals), genotype 
analysis can be performed (e.g., on blood, tail tissue, etc.) to determine whether any 

30 contain the hepsin-disrupted gene locus, thus identifying the animal's origin. 

The present invention also relates to a non-human transgenic animal, preferably 
a mammal, more preferably a mouse, which comprises a serine protease gene, 
preferably a hepsin gene, which has been engineered employing a recombinant nucleic 
acid according to the present invention. Generally, a transformed host cell, preferably a 

35 totipotent cell, whose endogenous gene has been modified using a recombinant nucleic 
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acid as described above is employed as a starting material for a transgenic embryo. 
The preferred methodology for constructing such a transgenic embryo involves 
transformed embryonic stem (ES) cells employing a targeting vector comprising a 
recombinant nucleic acid according to the invention. A particular gene locus, e.g. t 
hepsin, is modified by targeted homologous recombination in cultured ES cells 
employing a targeting vector comprising a recombinant nucleic acid according to the 
invention. The ES cells are cultured under conditions effective for homologous 
recombination. Effective conditions include any culture conditions which are suitable for 
achieving homologous recombination with the host cell chromosome, including effective 
temperatures, pH, medias, additives to the media in which the host cell is cultured (e.g., 
for selection, such as G418 and/or FIAU), cell densities, amounts of DNA, culture 
dishes, etc. Cells having integrated the targeting vector are selected by the appropriate 
marker gene present in the vector. After homologous recombination has been 
accomplished, the cells contain a chromosome having a recombinant gene. In a 
preferred embodiment, this recombinant gene contains a positive selectable marker 
gene fused to endogenous hepsin gene sequences. 

The transformed or genetically modified ES cells can be used to generate 
transgenic non-human mammals, e.g., mice, by injection into blastocysts and allowing 
the chimeric blastocysts to mature, following transfer into a pseudopregnant mother. 
See, e.g., Teratomacarcinoma and Embryonic Stem Cells: A Practical Approach, E.J. 
Robertson, ed., IRL Press. Various stem cells can be used, as known in the art, e.g.,- 
AB-1, HM-1 D3, CC1.2, E-14T62a, or RW4. Offpsring born to foster mothers may be 
screened initially for mosaic coat color where a coat color selection strategy has been 
employed. Alternatively, DNA from tail tissue of the offpsring can be used to screen for 
the presence of the DNA targeting vector. Offpsring that appear to be mosaics are then 
crossed to each other, if it believed they carry the modified gene in their germ line, in 
order to generate homozygotes for the modified gene. See, e.g, USP 5,557,032; USP 
5,532,158. 

In addition to the ES cell methods described here, transgenic animals can be 
created by other methods, e.g., by pronuclear injection of recombinant genes into 
pronuclei of one-cell embryos, incorporating an artificial yeast chromosome into 
embryonic stem cells, gene targeting methods, embryonic stem cell methodology. See, 
e.g., U.S. Patent Nos, 4,736,866; 4,873,191; 4,873,316; 5,082,779; 5,304,489; 
5,174,986; 5,175,384; 5,175,385; 5,221,778; Gordon et al., Proc. Natl. Acad. Sc/., 
77:7380-7384 (1980); Palmiter et aL, Ce//, 41:343-345 (1985); Palmiter et al., Ann. Rev. 



WO 98/54307 



- 18- 



PCT/EP98/03199 r 



Genet. 20:465-499 (1986); Askew et al., Mol. Cell Bio., 13:4115-4124 (1993); Games 
et al. Nature, 373:523-527 (1995); Valancius and Smithies, Mol. Cell. Bio., 1 1:1402- 
1408 (1991); Stacey et al., Mol. Cell. Bio., 14:1009-1016 (1994); Hasty et ah, Nature, 
350:243-246 (1995); Rubinstein etaL, Nucl. Acid Res., 21:2613-2617 (1993). 
5 As discussed, the invention relates to a transgenic mammal, such as a mouse, 

comprising cells which contain at least one functionally disrupted recombinant hepsin 
gene (e.g., heterozygous or homozygous) at a chromosomal hepsin gene locus. The 
cells can be created in accordance with the example below by inserting an exogenous 
nucleotide sequence into the hepsin gene. However, other methods can be used to 

10 create a functionally interrupted gene. For example, a termination codon can be 

inserted into the hepsin gene, using, e.g., a replacement type vector as described in 
Rubinstein et al.. Nucleic Acid Research, 21: 2613-2617, 1993 or a tag-and-exchange 
strategy as described in Askew et ah, MoL Cell. Bio., 13: 4115-4124, 1993, etc. 
Functional interruption of a hepsin gene can also be achieved classically by 

15 mutagenesis, such as chemical or radiation mutagenesis. 

A recombinant nucleic acid molecule according to the present invention can be 
introduced into any non-human mammal, including a mouse (Hogan et al., Manipulating 
the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1986), pig (Hammer et al., Nature, 315:343-345, 1985), sheep 

20 (Hammer et al., Nature, 315:343-345, 1985) f cattle, rat, or primate. See also, e.g., 

Church, Trends in Biotech. 5:13-19, 1987; Clark et al., Trends in Biotech. 5:20-24, 
1987; and DePamphiiis et al., BioTechniques, 6:662-680, 1988. In the examples below, 
a murine hepsin gene is modified by homologous recombination utilizing a gene 
targeting vector comprising regions of the murine hepsin gene. To carry out genetic 

25 modification of another mammalian hepsin gene, e.g., a rat or a primate, it may be 

desirable to obtain analogous regions of the target hepsin gene. A hepsin gene from 
another species, using a murine or human hepsin gene, can be accomplished by 
various methods known in the art, e.g., PCR using a mixture of oligonucleotides based 
on a consensus sequence or serine proteases and/or hepsins (e.g., Leytus et al. f 

30 Biochemistry, 27: 1067-1074, 1988), nucleic acid hybridization using oligonucleotides, 

cDNA, etc., at a desired stringency (e.g., Sambrook et al., Molecular Cloning, 1989, 
Chapter 11). 

A transgenic animal according to the present invention can comprise one or 
more genes which have been modified by genetic engineering. For example, a 
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transgenic animal comprising a hepsin gene which has been modified by targeted 
homologous recombination in accordance with the present invention can comprise 
other mutations, including modifications at other gene loci and/or transgenes. 
Modifications to these gene loci and/or introduction of transgenes can be accomplished 
in accordance with the methods of the present invention, or other methods as the 
skilled worker would know. For instance, double-mutants can be made by conventional 
breeding, i.e., crossing animals and selecting for a desired phenotype and/or genotype. 
In one embodiment of the invention, a transgenic animal can be constructed having at 
least one gene defective in hepsin (e.g., a knock-out) and one or more defective genes 
coding for a product utilized in the blood coagulation pathway, e.g., factor VII or factor 
IX. In a preferred embodiment, the latter genes are null or functionally-disrupted. Such 
an animal can be homozygous or heterozygous for the desired loci, or a combination 
thereof. 

The present invention also relates to a murine hepsin specific amino acid 
sequence selected from the sequence of amino acid 1 to 416 as set forth in Fig. 2. A 
murine hepsin specific amino acid sequence means a defined amino acid sequence 
which is found in the recited hepsin sequence but not in another amino acid sequence, 
especially not another hepsin sequence. A specific amino acid sequence can be found 
routinely, e.g., by searching a gene/protein database using the BLAST set of computer 
programs. A murine hepsin cDNA can be useful, e.g., as a probe to obtain murine 
genomic DNA or to produce peptides as antigens to generate an immune response 
specific for murine hepsin. Antibodies obtained by such immunization can be used as a 
specific probe for the murine hepsin for diagnostic or research purposes. Such peptides 
can also be used to inhibit a hepsin activity in cells or cell-free systems. 

A murine hepsin polypeptide, fragment, or murine hepsin polypeptide can aiso 
comprise various modifications, where such modifications include glycosylation, 
covalent modifications (e.g., of an R-group of an amino acid), amino acid substitution, 
amino acid deletion, or amino acid addition. Modifications to the polypeptide can be 
accomplished according to various methods, including recombinant, synthetic, 
chemical, etc. 

A polypeptide according to the present invention can be produced in an 
expression system, e.g., in vivo, in vitro, cell-free, recombinant, cell fusion, etc., 
according to the present invention. Modifications to the polypeptide imparted by such 
system include, glycosylation, amino acid substitution (e.g., by differing codon usage), 
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polypeptide processing such as digestion, cleavage, endopeptidase or exopeptidase 
activity, attachment of chemical moieties, including lipids, phosphates, etc. For 
example, some cell lines can remove the terminal methionine from an expressed 
polypeptide. 

A polypeptide according to the present invention can be recovered from natural 
sources, transformed host cells (culture medium or cells) according to the usual 
methods, including, ammonium sulfate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, hydroxyapatite chromatography and lectin 
chromatography. Protein refolding steps can be used, as necessary, in completing the 
configuration of the mature protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

For other aspects of the nucleic acids, polypeptides, antibodies, etc., reference 
is made to standard textbooks of molecular biology, protein science, and immunology. 
See. e.g., Davis et al. (1986), Basic Methods in Molecular Biology, Elsevir Sciences 
Publishing, Inc., New York; Hames et al. (1985), Nucleic Acid Hybridization, IL Press, 
Molecular Cloning, Sambrook et al.; Current Protocols in Molecular Biology, Edited by 
F.M. Ausubel et al., John Wiley & Sons, Inc; Current Protocols in Human Genetics, 
Edited by Nicholas C. Dracopoli et al., John Wiley & Sons, Inc.; Current Protocols in 
Protein Science; Edited by John E. Coligan et al., John Wiley & Sons, Inc.; Current 
Protocols in Immunology, Edited by John E. Coligan et al., John Wiley & Sons, Inc. 

The present invention also relates to a transgenic mouse comprising cells which contain 
at least one functionally disrupted recombinant hepsin gene at a chromosomal hepsin 
gene locus, wherein said disruption prevents functional expression of the hepsin 
polypeptide coded for by the hepsin gene, and wherein the serum alkaline phosphatase 
levels in the blood of said mouse is elevated above the normal range. Said disruption 
prevents expression of the hepsin polypeptide, and wherein ahead chomosomal hepsin 
gene locus in said cells is a functionally disrupted recombinant hepsin gene. 

The invention further comprises a transgenic mouse, wherein the cells, which contain at 
least one functionally disrupted recombinant hepsin gene, produce a truncated hepsin 
polypeptide comprising a transmembrane domain, further said functionally disrupted 
recombinant hepsin gene comprises a first selectable nucleotide sequence which 
confers a first selection characteristic on the cells in which it is present. 
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The present invention also provides a transgenic mouse comprising cells which lack 
exons 5-8 of at least one recombinant chromosomal hepsin gene, wherein said 
recombinant gene does not produce a functionally active hepsin polypeptide, and 
wherein the serum alkaline phosphatase levels in the blood of said mouse is elevated 
5 above the normal range. 



The instant invention also provides a method or process for producing a transgenic 
mouse having at least one functionally disrupted chromosomal hepsin gene, and 
wherein the serum alkaline phosphatase levels in the blood of said mouse is elevated 
10 above the normal range, comprising: 

(a) introducing a nucleic acid molecule of claim 1 into mouse ES cells; 

(b) culturing said ES cells under conditions effective for homologous 
recombination between said nucleic acid and a hepsin gene of the ES cells; 

(c) selecting cells having the recombinant nucleic acid molecule intregrated 
15 by homologous recombination into a chromosomal hepsin gene of the ES cells to obtain 

transformed ES cells; 

(d) introducing the transformed ES cells into a blastocyst; 

(e) implanting the blastocyst into a pseudopregnant mouse; and 

(f) allowing said embryo to develop to term, 

20 

The invention also provides a transgenic mouse produced according to said method or 
process. The method or process can be used for the identification of at least one 
transgenic embryo containing the recombinant nucleic acid molecule, and further for 
breeding an embryo upon maturation to form a transgenic line of mice comprising a 
25 functionally disrupted chromosomal hepsin gene. 

The invention further relates to a method or process of identifying an agent which 
modulates an activity of a mammalian hepsin comprising: 

administering an agent which modules an activity of a hepsin to a first cell in 
30 which expression of at least one copy of a hepsin gene is functionally disrupted, 

administering said agent to a second cell which contains at least one more copy 
than the first cell of an endogenous hepsin gene whose expression is complete, and 

measuring the amount of hepsin activity in the first and second cell. 
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EXAMPLES 

Cloning of mouse hepsin cDNA: 

Mouse hepsin cDNA was cloned using a PCR-based strategy. Total RNA was 
isolated from mouse liver with a CsCI gradient ultracentrifugation method (Sambrook et 
al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., 1989). Single-stranded cDNAs were synthesized from 12 pg 
total RNA using avian myeloblastosis virus reverse transcriptase and an oligo(dT) 
primer (cDNA Cycle Kit, Invitrogen). The cDNAs were used as templates for PCR 
amplification (30 cycles of 1.5-min annealing at 55 C, 1.5-min extension at 72 C, and 1- 
min denaturation at 94 C) with sense primer 5'-TTC GCC ATC TGC CAA GAC TG-3'(*) 
and anti-sense primer 5-GTG AGT CTTTAT GGC CTG GAA GAT-3'**) * T ^ e sequences 
of primers are based the published sequence of human heDsin cDNA (Levtus et al.. 
Biochemistry 27:1067-1074, 1988). (*) = Seq. ID, No, 3,**) = Seq. ID Ho, 4). 

A 785-bp DNA fragment amplified from the PCR reaction was cloned into the 
plasmid vector pCR (Invitrogen). Sequencing of the DNA fragment showed that it 
shared >90% sequence identity with human hepsin cDNA. Additional clones were 
obtained by radiolabeling of the cDNA fragment with [ 32 P]dCTP and screening of mouse 
liver cDNA (Wetsel et al, J. Biol. Chem. 265:2435-2440, 1990) or genomic libraries 
(Stratagene) or by using oligocucleotides to screen 5' rapid amplification of cDNA ends 
(RACE) libraries (Clontech). A full-length mouse cDNA sequence is shown in Fig. 2. 
Mouse and human hepsin cDNAs share over 84% sequence identities. 

Cloning of mouse genomic gene: 

To clone the mouse hepsin genomic gene, a partial murine cDNA was purified 
and radiolabeled with [ 32 P]dCTP using a random primer labeling kit (Amershan). The 
probe was used to screen a 129Sv mouse genomic library (Stratagene). Hybridization 
was performed with 2 x 10 6 cpm/ml radiolabeled probe, 40% formamide, 6 x SSC, 5 x 
Denhardt's solution, 0.1% SDS and 100 pg/ml salman sperm DNA. One positive clone 
was identified after screening of 8 x 10 4 phage clones. Lambda phage DNA from this 
clone was purified using a phage preparation kit from Stratagene. The genomic DNA 
insert was isolated by restriction enzyme digestion and subsequently cloned into 
plasmid vector Bluescript (Stratagene). 

The genomic structure of the mouse hepsin gene was determined by restriction 
enzyme digestion, hybridization with radiolabeled oligonucleotides, subcloning of DNA 
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fragments and sequencing. As shown in Fig. 1, the mouse hepsin gene spans -15 kb 
and contains at least 12 exons shown in black boxes. The three serine protease active " 
site residues His (H), Asp (D) and Ser (S) are encoded by exons 7, 8 and 11, 
respectively. Selected restriction enzyme sites are indicated in Fig. 1 by the vertical 
lines. 

Construction of the targeting vector: 

To construct the targeting vector, exons 5-8 of the hepsin gene were deleted 
and replaced by a neo gene cassette that serves as a positive selection marker (Fig. 1) 
(Capecchi, Science 244: 1288-1292, 1989). The deletion abolished the potential hepsin 
activation cleavage site as well as the protease catalytic site His and Asp residues. At 
the 3' end of the vector, a herpes simplex virus thymidine kinase (tk) gene cassette was 
also included to serve as a negative selection marker (Fig. 1) (Capaccni, Science 
244:1288-1292, 1989). The structure of the targeting vector was verified by restriction 
enzyme digestion and DNA sequencing. Large scale preparation of supercoiled 
plasmid DNA was performed using a CsCI gradient ultracentrifugation method 
(Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989). 

Disruption of the hepsin gene in mouse embryonic stem cells: 

Pluripotent mouse embryonic stem (ES) cells (RW4) were purchased from 
Genome Systems (St. Louis). ES cells were cultured in 100-mm petri dishes containing 
a layer of primary mouse embryonic fibroblasts that were pretreated with 1 0 pg/ml 
mitomycin C (Joyner A, Gene Targeting: A Practical Approach, IRL Press, N.Y., 1993). 
ES cell culture medium consists of DMEM (Gibco), 15% fetal bovine serum (Hyclone), 2 
mM L-glutamine, 1 mM sodium pyruvate, 0.1 mM non-essential amino acid, 20 mM 
Hepes, 11pM 2-mercaptoethanol, 50 pg/ml penicillin-streptomycin, and 10 3 U/ml 
leukemia inhibitory factor (LIF) (Gibco). 

The targeting vector was introduced into ES cells by electroporation using 
Electro Cell Manipulator (ECM 600, BTX Inc.) under conditions recommended by the 
manufacturer, A total of 1.5 x 10 7 ES cells was transfected with 30 pg of linearized 
vector DNA. Transfected ES cells were cultured in 100-mm petri dishes and selected in 
the presence of 250 pg/ml G418 (Gibco) and 2 p ganciclovir (Syntex Laboratories, 
Inc.). Over 700 stable transfected cell lines from the selection were obtained. Genomic 
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DNA from the cells was isolated and digested with restriction enzyme Hindlll. Southern 

analysis was performed with a mouse hepsin genomic probe flanking the hepsin gene 
deletion and a neo gene probe. 

Seven mutant ES cell lines were identified in which one copy of the endogenous 
5 hepsin gene was disrupted by homologous recombination. The disruption occurred 

only at the single locus and there was no additional neo gene insertion found in the 
genome. Karyotypic analysis (Chromosome Inc, St. Louis) showed that six cell lines 
had a normal male karyotype (40, XY). The predicted genomic structure in the mutant 
allele after homologous modification is shown in the lower panel of Fig. 1 . 

10 

Generation of chimeric mice and germline transmission of the mutant allele: 

To produce chimeric mice, four mutant ES cell lines were selected for blastocyst 
microinjection experiments that were performed by a commercial service, Stem Cell 
Gene Altered Mouse Core Service, University of Cincinnati (Cincinnati, OH). Mouse 

15 blastocysts were derived from wild-type C57BI/6J mice. About 10-15 hepsin mutant ES 

cells were injected into each blastocyst under a microscope. The blastocysts were 
further implanted into the uterus of pseudogregnant foster mothers. Implantation of 435 
blastocysts yielded 36 chimeric offspring. Ninety percent of the animals have >50% 
coat color chimerism, an indication of significant ES cell contribution in the animals. 

20 All chimeric mice were bred with NIH black swiss mice. Germline transmission 

was achieved with nine chimeric mice that were derived from injections of three 
independent cell lines. Breeding of these chimeric mice produced a total of 108 agouti 
pups. 

Genomic DNA was obtained from mouse tail biopsy samples using Puregene 
25 DNA Isolation kit (Gentra Systems, NC). Genotypes of the agouti mice were 

determined by a PCR-based method. In the PCR reaction two sets of oligonucleotide 
primers were included: one set specific for the neo gene (sense primer: 5-CAA GAT 
GGA TTG CAC GCA GG-3'; anti-sense primer: 5'-GTA AAG CAC GAG GAA GCG G- 
3') and another set specific for the hepsin gene (sense primer: 5'-ACG GCA CAT CGG 
30 GCT TCT TTT-3'; antisense primer: 5-AAT GCG GTC CAC CGG CAG CTT CCT-3'). 

Seq. ID Nos. 6 - Seq. ID Nos. 8). In each reaction, about 100 
ng of genomic DNA was used as templates for the amplification 
(30 cycles of 1.5-min annealing at 59 C, 1.5-min extension at 
72 C, and 1-min denaturation at 94 C) . The neo gene was iden^:.- 
tified in 49 agouti offspring, 
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indicating germiine transmission of the mutant hepsin allele. The results of PGR 
analysis were further confirmed by Southern analysis using the hepsin genomic probe. 

Generation of homozygous hepsin-deficient mice: 

Male and female heterozygous hepsin-deficient mice were bred and over 400 
offspring were produced. The genotype of these newborn mice was determined by the 
PCR analysis, as described the previous section. The PCR reaction allows us to 
determine genotypes of wild-type, heterozygous and homozygous mice. 

Northern analysis of hepsin mRNA expression: 

Total RNA was extracted from frozen liver tissue using the UltraspecTM RNA kit 
(Biotecx Lab. Inc., Houston, TX). Samples (15 mg) were fractionated on denaturing 
agarose gels, transferred onto nitrocellulose membranes and hybridized with 

32 

P-labeled mouse hepsin cDNA probes spanning the entire coding region. To 
determine mRNA expression of other liver specific genes, the blot was reprobed with a 
partial mouse prothrombin cDNA that was PCR-amplified based on the published 
sequence. Degen, S.J.F., Schaefer, LA., Jamison, C.S., Grant, S.G. Fitzgibbon, JJ. 
Pai, J.-A., Chapman, V.M., & Elliott, R.W. (1990) DNA & Cell BioL % 9, 487-498. 

Hemostasis assays and blood chemistry analysis: 

Blood samples were drawn from the inferior vena cava into one-tenth volume of 
3.8% sodium citrate. Blood cell counts and hematocrit were analyzed using a 
hemostasis instrument (9210 CP Baker System). Clotting assays (aPTT and PT) were 
performed with platelet poor plasma using the MLA Electra 900C programmable 
coagulometer. In the tail bleeding assay, mice were anesthetized with pentobarbital (90 
mg/kg). The tip of the mouse tail (1 mm in diameter) was cut. Tails were soaked 
immediately in 0.9% NaCI saline prewarmed at 37°C and the time until bleeding 
stopped completely was recorded. For blood chemistry analysis, serum samples were 
prepared from clotted whole blood. Chemistry analysis was performed by Consolidated 
Veterinary Diagnostics, Inc., West Sacramento, CA. Alkaline phosphatase isoenzymes 
were analyzed by a quantitative method described by Koffmann et aL Koffmann, W.E., 
Everds, N. t Pignatello, M., & Solter, P.F. (1994) Toxicol, Pathol. 22, 633-638. 
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LPS-induced septic shock and thromboplastin-induced DIC: In the 

lipopolysaccharide (LPS)-induced septic shock model, mice (13-14 weeks old) were 
injected intraperitoneaily with 40 mg/kg of LPS from E. coli serotype 01 1 1 :B4 
(Calbiochem). The mice were monitored for signs of endotoxemia and lethality 4 times 

5 a day for the first 3 days and periodically thereafter. In the thromboplastin-induced 

disseminated intravascular coagulation (DIC) model, anesthetised mice were injected 
intravenously with human placental thromboplastin (Behring, Marburg, Germany). The 
linear range of the thromboplastin dose was determined experimentally. The time in 
minutes was recorded between the injection of thromboplastin and acute death as 

10 measured by cessation of breathing or heart beating. 

Histological analysis: 

Mice were sacrificed under C0 2 narcosis and tissues were fixed by perfusion 
through the heart with an aldehyde fixative consisting of 4% (w/v) paraformaldehyde, 
15 3% (w/v) sucrose, in 0.1 M phosphate buffer, pH 7.3. Tissues were dehydrated in 

graded ethanol solutions and embedded in paraffin blocks. Tissue sections were cut at 
5 pm thickness, stained with hematoxylin and eosin (H & E), and examined by light 
microscopy. 



Disruption of the hepsin gene in ES cells and generation of chimeric mice: 

An 18-kb DNA fragment that contained the mouse hepsin gene was isolated 
from a 129Sv genomic library. The hepsin gene contains at least 12 exons (Fig. 1). 
The serine protease active site residues, His, Asp, and Ser are encoded by exons 7, 8, 
and 1 1 , respectively. A replacement targeting vector was constructed by deleting the 
Kpn I/Bam HI fragment containing exons 5-8 that code for the conserved activation site 
and most of the catalytic domain (Fig. 1). Over 700 stable ES cell clones transfected 
with the targeting vector were analyzed by Southern blotting. Seven ES clones were 
identified in which one copy of the endogenous hepsin gene was disrupted by 
homologous recombination. Four mutant ES clones were microinjected into blastocysts 
from C57BL/6J mice to generate chimeric mice. Germline transmission was achieved 
with nine chimeric mice derived from three independent cell lines. The results of 
phenotypical analysis were obtained from mice derived from all these three ES cell 
lines. 
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Viability, growth and fertility of homozygous hepsin-/- mice: 

Breeding of hepsin+/- mice produced over 500 offspring. Genotypes of F2 
progeny were determined by PCR analysis and the results were confirmed by Southern 
blotting in randomly selected samples. Wild-type, hepsin+/- and hepsin-/- mice 
represented 24.3%, 50.0% and 25.7% of progeny, respectively. The observed 
Mendelian pattern of inheritance indicates that homozygous hepsin-deficiency is 
compatible with normal embryonic development. Male to female ratio was 1:1 in 
wild-type, hepsin+A- and hepsin-/- mice. The absense of hepsin mRNA expression in 
liver samples from hepsin-/- mice was demonstrated by Northern analysis using cDNA 
probes spanning the entire coding region. As a control, the expression of prothrombin 
mRNA in the liver from hepsin-/- mice was not significantly affected. 

When followed to 9 months, hepsin-/- mice appeared to develop normally and 
exhibited similar body weight gain as wild-type and hepsin+/- litter mates. No obvious 
physical abnormalities were observed in hepsin-/- mice. In hematologic examinations, 
values for red blood cell, white blood cell and platelet counts, hematocrit and 
hemoglobin were similar in hepsin-/- mice and wild-type controls. Both male and 
female hepsin-/- mice were fertile and produced viable offspring. 

Tail bleeding and plasma clotting assays in hepsin-/- mice: 

To evaluate the physiological importance of hepsin function in blood 
coagulation, tail bleeding time and plasma clotting times (aPTT and PT) were 
determined in hepsin-/- mice. The mouse tail bleeding time measures both platelet and 
blood clotting activities, whereas aPTT and PT assays measure the function of the 
intrinsic and extrinsic blood coagulation pathways, respectively. As shown in Table 1, 
no prolongation in time was detected in hepsin-/- mice in these three assays, indicating 
that hepsin deficiency did not adversely affect hemostasis. These results are 
consistent with the observation that no spontaneous bleedings occurred in hepsin-/- 
mice. 

Thromboplastin-induced DIC and LPS-induced septic shock models: 

To examine the potential function of hepsin under pathological conditions, 
hepsin-/- mice were challenged in DIC and septic shock models. In the DIC model, 
thromboplastin was injected intravenously. The times at which breathing or heart 
beating stopped were similar in both hepsin-/- mice and wild-type controls (Table 1 ), 
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indicating that the tissue factor-dependent coagulation pathway was not altered in 
hepsin-/- mice. This was supported by the results from a septic shock model, in which 
survival rates of hepsin-/- and wild-type mice after injection of a high dose of LPS were 
not significantly different. 

Blood chemistry analysis for liver function: 

In addition to blood coagulation, hepsin was reported to be required for growth 
of hepatocytes in vitro. Torres-Rosado, A., O'Shea, K.S., Tsuj.i, A., Chou, S.-H., & 
Kurachi, K. (1993) Prac. Nati. Acad. ScL USA 90, 7181-7185. To examine the effects 
of hepsin deficiency on liver function, blood samples from hepsin-deficient mice were 
analysed. As shown in Table 2, serum concentrations of alanine aminotransferase 
(SGPT), g-glutamyltranspeptidase (GGT), bilirubin, albumin, globulin, and total protein 
were similar in hepsin-/- and wild-type mice, and values were within the normal ranges. 
Interestingly, serum alkaline phosphatase levels were approximately 2-fold higher in 
hepsin-/- mice when compared to wild-type litter mates (Fig. 3). The elevation of serum 
alkaline phosphatase was found in both male and female hepsin-/- mice at different 
ages (Fig. 3). Alkaline phosphatase isoenzymes were analyzed, and results indicated 
that the elevated alkaline phosphatase in the serum from hepsin-/- mice was mostly of 
bone origin (Fig. 4). . 

Morphologic and histologic analyses: 

Necropsies and histologic examinations were performed on hepsin-/- mice at 
various ages. No gross abnormalities were indentified in major organs including liver, 
brain, lung, heart, kidney, pancreas, spleen and bone. Livers from hepsin-/- and 
wild-type mice appeared similar and had comparable weights (Table 2). In 
hemtoxylin/eosin-stained sections including liver and bone from hepsin-/- mice were 
histologically indistinguishable from those of wild-type mice (data not shown). X-ray 
examinations were also performed to detect potential defects in the skeleton. No 
structural abnormalities were found in long bones, pelvis and vertebrae in hepsin-/- 
mice at the age of 5 months (data not shown). 

Summary of Results 

Human recombinant hepsin expressed on the surface of BHK cells has been 
reported to specifically activate FVIl, Kazama, Y. T Hamamoto, T., Foster, D.C., and 
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Kisiel, W., J. Biol. Chem., 270:66-72 (1995), an essential enzyme that acts early in the_ 
tissue factor-dependent coagulation pathway, Wildgoose, P., & Kisiel, W., Blood, 
73:1888-1895 (1989). This finding was confirmed in our own studies (data not shown), 
indicating a function of hepsin in the initiation of blood coagulation. Genetic and 
site-directed mutagenesis studies show that the activation cleavage of the Age152-lle 
peptide bond in FVII is required to convert the zymogen to an active enzyme, Broze, 
G.J., & Leykam, J.E., Schwartz, B.D., and Miletich, J. P., J. Biol. Chem., 260:10917- 
10920 and Wildgoose, P. & Kisiel, W M Blood, 73:1888-1895, 1989. The mechanism of 
FVII activation under physiological conditions is not completely understood. In addition 
to hepsin, factors IXa, Xa and Xlla and thrombin have been reported to activate FVII in 
vitro, Kisiel, W., Fujikawa, K., Davie, E.W., Biochemistry, 16:4189-4194 (1977); 
Radcliffe, R., Bagdasarian, A., Colman, R., Nemerson, Y. t Blood, 50:611-617 (1977); 
Broze, G.J., Leykam, J.E., Schwartz. B.D., Miletich, J. P., J. Biol. Chem., 260:10917- 
10920 (1985); Wildgoose, P., Nemerson, Y., Hansen, L.L, Nielsen, F.E., Glazer, S., 
and Hedner, U. f Blood, 80:25-28 (1992); and Eichinger, S., Mannucci, P.M., Tradati, F., 
Arbini, A.A., Rosenberg, T.D., and Bauer, K.A., Blood, 86:3021-3025 (1995). Recent 
studies with haemophilia patients suggest that FIXa is primarily responsible for the 
basal levels of FVIIa in vivo, Miller, G., Martin, J.C., Mitropoulos, K.A., Esmouf, M.P., 
Cooper, J.A., Morrissey, J.H., Howarth, D.J., and Tuddenham, E.G.D., Blood, 87:4187- 
4196 (1996); Neuenschwander, P.F. and Morrissey, J.H., J. Biol. Chem., 267:14477- 
14482 (1992); Morrissey, J.H., Macik, G., Neuenschwander, P.F. and Company, P.C., 
Blood, 81:734-744 (1993); Fenton, J.W., Ann. N.Y. Acad. Sc/., 485:5-15 (1986); 
Nakamura, T., Nishizawa, T., Hagiya, M., Seki, T., Shimonishi, M., Sugimura, A., 
Tashiro, K., and Shimizu, S., Nature, 342:440-443 (1989); Manfioletti, G., Brancolini, C, 
Avanzi, G., and Schneider, C, Moi Cell. Biol, 13:4976-4985 (1993). A hepsin knock- 
out mice in accordance with the present invention is useful to study the above- 
mentioned pathway, e.g., by preparing animals homozygous or heterozygous for the 
hepsin knock-out in combination with one or more mutations in any of the above- 
mentioned genes. 

Without further elaboration, it is believed that one skilled in the art can, using the 
preceding description, utilize the present invention to its fullest extent. The preceding 
preferred specific embodiments are, therefore, to be construed as merely illustrative, 
and not limitative of the remainder of the disclosure in any way whatsoever: 
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The entire disclosure of all applications, patents and publications, cited above 
and in the figures are hereby incorporated by reference. 

From the foregoing description, one skilled in the art can easily ascertain the 
essential characteristics of this invention, and without departing from the spirit and 
scope thereof, can make various changes and modifications of the invention to adapt it 
to various usages and conditions. 



Functional hemostatis assays in wild-type and hepsin-deficient mice. 





Hepsin 


Hepsin 


Tail bleeding time (sec.) 


70.6 ± 19.7 


77.3 ± 28.7 




(n=5) 


(n=7) 


PT (sec.) 


20.9 ±6.6 


22.6 ± 4.5 




(n=6) 


(n=5) 


aPTT (sec.) 


37.2 ±11.2 


38.4 ± 9.1 




(n=9) 


(n=6) 


Thromboplastin-induced DIC: 






cessation of breathing (min.) 


1.30± 0.16 


1.42 ±0.42 




(n=5) 


(n=6) 


cessation of heart beating (min.) 


1.86 ±0.34 


1.97 ± 0.45 




(n=5) 


(n=6) 



The tail bleeding time, plasma prothrombin time (PT) and activated partial 
thromboplastin time (aPTT) were performed with 9-12 weeks old mice, as described 
under "Materials and Methods." In the thromboplastin-induced DIC model, acute death 
was ascertained as cessation of either breathing or heart beating. Data are presented 
as mean ± SD and the number of mice in each experimental group is indicated in 
parentheses. 
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Liver weight and blood chemistry analysis for liver function. 





nepsin 


Hepsin ''* 


i— ivci wciLjiH ) 


0.96 ± 0.13 


0.92 ± 0.08 




(n=6) 


(n=5) 


SGPT HU/U 


25.75 ± 8.27 


31.38 ± 7.86 




/-, oo \ 

(n-28) 


(n=29) 


GGT (\U/L) 


0.71 ± 0.75 


1.0 ± 0.58 




(rw) 


(n=7) 


Tntal nrntp»in /n/Hh 


5.48 ± 0.28 


4.87 ± 0.43 




(n-18) 


(n=13) 


Albumin (n/ri\\ 


2.81 ± 0.24 


2.32 ± 0.15 




(n-18) 


(n=13) 


Globulin (g/dl) 




2.55 ± 0.31 




(n=18) 


(n=13) 


Total bilirubin (mg/dl) 


0.17 ± 0.07 


0.13 ±0.07 




(n=14) 


(n=11) 



Wild-type and hepsin mice (8-12 weeks old) were sacrificed under C0 2 
narcosis. The liver was dissected out and weighed. Blood chemistry analysis was 
performed as described under "Materials and Methods." Data are presented as mean ± 
SD and the number of mice in each experimental group is indicated in parentheses. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

APPLICANT: 1. SCHERING AKT I ENGES ELLS CHAFT 
2. Washington University 



(ii) TITLE OF INVENTION: KNOCKOUT MICE AND THEIR PROGENY WITH 
DISRUPTED SERINE PROTEASE GENES 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sobering Aktiengesel lschaf t 

(B) STREET: Mullers trafie 178 

(C) CITY: Berlin 

(D) STATE : Berlin 

(E) COUNTRY: Germany 

(F) ZIP: D-13353 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/000,846 
-(B) FILING DATE: 30-DEC-1997 
(C) CLASSIFICATION: 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 030-4 681-2085 

(B) TELEFAX: 030-4681-2058 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

• (A) ORGANISM: Mus musculus 

(ix) FEATURE: 
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(A) NAME /KEY : CDS 

(B) LOCATION: 35.. 1282 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCAACCTGG GAATCATTAA CAAGAGTCCC TGAC ATG GCG AAG GAG GGT GGC 52 

Met Ala Lys Glu Gly Gly 
1 5 

CGG ACT GCA GCA TGC TGC TCC AGA CCC AAG GTG GCA GCT CTC ATT GTG 100 
Arg Thr Ala Ala Cys Cys Ser Arg Pro Lys Val Ala Ala Leu He Val 
10 15 20 

GGT ACC CTG CTG TTC CTG ACA GGC ATT GGG GCC GCG TCC TGG GCC ATT 148 
Gly Thr Leu Leu Phe Leu Thr Gly He Gly Ala Ala Ser Trp Ala He 
25 30 35 

GTG ACC ATC CTA CTG CAG AGT GAC CAG GAG CCA CTG TAC CAA GTG CAG 196 
Val Thr He Leu Leu Gin Ser Asp Gin Glu Pro If-u Tvr Gin Val Gin 
40 45 50 

CTC AGT CCA GGG GAC TCA CGG CTT GCG GTG TTT GAC AAG ACG GAG GGA 24 4 
Leu Ser Pro Gly Asp Ser Arg Leu Ala Val Phe Asp Lys Thr Glu Gly 
55 60 65 70 

ACG TGG AGG CTA CTG TGC TCC TCA CGC TCC AAT GCC AGG GTG GCA GGG 2 92 

Thr Trp Arg Leu Leu Cys Ser Ser Arg Ser Asn Ala Arg Val Ala Gly 

75 80 85 

CTC GGC TGT GAG GAG ATG GGC TTT CTC AGG GCT CTG : GCG CAC TCG GAG 340 
Leu Gly Cys Glu Glu Met Gly Phe Leu Arg Ala Leu Ala His Ser -Glu 
90 95 100 

CTG GAT GTC CGC ACT GCG GGC GCC AAC GGC ACA TCG GGC TTC TTT TGC 38 8 

Leu Asp Val Arg Thr Ala Gly Ala Asn Gly Thr Ser Gly Phe Phe Cys 
105 110 115 

GTG GAC GAG GGC GGA CTC CGT CTG GCT CAG AGG TTG CTG GAT GTC ATC 4 36 

Val Asp Glu Gly Gly Leu Arg Leu Ala Gin Arg Leu Leu Asp Val lie 
120 125 130 

TCT GTA TGT GAC TGT CCT AGA GGC CGA TTC CTG ACT GCC ACC TGC CAA 4 84 

Ser Val Cys Asp Cys Pro Arg Gly Arg Phe Leu Thr Ala Thr Cys Gin 
!35 140 145 150 

GAC TGT GGC CGC AGG AAG CTG CCG GTG GAC CGC ATT GTG GGG GGC CAG 532 
Asp Cys Gly Arg Arg Lys Leu Pro Val Asp Arg He Val Gly Gly Gin 

155 160 165 

GAC AGC AGT CTG GGA AGG TGG CCG TGG CAG GTC AGC CTG CGT TAT GAT 580 
Asp Ser Ser Leu Gly Arg Trp Pro Trp Gin Val Ser Leu Arg Tyr Asp 
170 175 180 

GGG ACC CAC CTC TGT GGG GGG TCC CTG CTG TCT GGG GAC TGG GTG CTG 62 8 

Gly Thr His Leu Cys Gly Gly Ser Leu Leu Ser Gly Asp Trp Val Leu 
185 190 195 
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ACT GCT GCA CAT TGC TTT CCA GAG CGG AAC CGG GTC CTG TCT CGG TGG 67 6 
Thr Ala Ala His Cys Phe Pro Glu Arg Asn Arg Val Leu Ser Arg Trp 
200 205 210 

CGA GTA TTT GCT GGT GCT GTA GCC CGG ACC TCA CCC CAT GCT GTG CAA 724 
Arg Val Phe Ala Gly Ala Val Ala Arg Thr Ser Pro His Ala Val Gin 
215 220 225 230 

CTG GGG GTT CAG GCT GTG ATC TAT CAT GGG GGC TAC CTT CCC TTT CGA 77 2 
Leu Gly Val Gin Ala Val lie Tyr His Gly Gly Tyr Leu Pro Phe Arg 
235 240 245 

GAC CCT ACT ATC GAC GAA AAC AGC AAT GAC ATT GCC TTG GTC CAC CTC 82 0 

Asp Pro Thr lie Asp Glu Asn Ser Asn Asp lie Ala Leu Val His Leu 
250 255 260 

TCT AGC TCC CTG CCT CTC ACA GAA TAC ATC CAG CCA GTG TGT CTC CCT 8 68 

Ser Ser Ser Leu Pro Leu Thr Glu Tyr lie Gin Pro Val Cys Leu Pro 
265 270 275 

GCT GCG GGA CAG GCC CTG GTG GAT GGC AAG GTC TGT ACT GTG ACC GGC 916 
Ala Ala Gly Gin Ala Leu Val Asp Gly Lys Val Cys Thr Val Thr Gly 
280 285 290 

TGG GGT AAC ACA CAG TTC TAT GGC CAA CAG GCT ATG GTG CTC CAA GAG 964 
Trp Gly Asn Thr Gin Phe Tyr Gly Gin Gin Ala Met Val Leu Gin Glu 
295 300 305 310 

GCC CGG GTT CCC ATC ATA AGC AAC GAA GTT TGC AAC AGC CCC GAC TTC 1012 
Ala Arg Val Pro lie lie Ser Asn Glu Val Cys Asn Ser Pro Asp Phe 

315 320 325 

TAC GGG AAT CAG ATC AAG CCC AAG ATG TTC TGT GCT GGC TAT CCT GAG 10 60 
Tyr Gly Asn Gin lie Lys Pro Lys Met Phe Cys Ala Gly Tyr Pro Glu 
330 335 340 

GGT GGC ATT GAT GCG TGC CAG GGC GAC AGT GGA GGC CCC TTT GTG TGT 1108 
Gly Gly lie Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Phe Val Cys 
345 350 355 

GAA GAC AGC ATC TCT GGG ACA TCA AGG TGG CGG CTA TGT GGC ATT GTA 1156 
Glu Asp Ser lie Ser Gly Thr Ser Arg Trp Arg Leu Cys Gly lie Val 
360 365 370 

AGC TGG GGT ACG GGC TGT GCT TTG GCC CGG AAG CCA GGA GTG TAC ACC 12 0 4 
Ser Trp Gly Thr Gly Cys Ala Leu Ala Arg Lys Pro Gly Val Tyr Thr 
375 380 385 390 

AAA GTC ACT GAC TTC CGG GAG TGG ATC TTC AAG GCC ATA AAG ACT CAC 12 52 
Lys Val Thr Asp Phe Arg Glu Trp lie Phe Lys Ala lie Lys Thr His 
395 400 405 

TCC GAA GCC AGT GGC ATG GTG ACT CAG CCC TGATCCCGCC TCATCTCGCT 1302 
Ser Glu Ala Ser. Gly Met Val Thr Gin Pro 
410 415 
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GCTCCGTGCT GCACTAGCAT CCAGAGTCAG AGTTGGTCTG GTGGCTCCAG CCCCACGTGG 1362 

TAGGCTCCAC ACTGGGCCTC ACATGGAATG GTTTCCTGCT CAGATCCAGT CCACGGGTCC 1422 

AAGGATGCTG GATCCAAGGA CTTCTCTTCC ACAGTGGCCG GCCCACTCAA TCCCAGGGCC 14 82 

ATTGGCCTCA CCCTCCCACC CCATGTAAAT ATTACTCTGT CCTCTGGGGG GCGCTCTAGG 154 2 

GAGCCCTTGT GCAGATGCTC TTTAAATAAT AAAGGTGGTT TTGATTAAAA AAAAAAAAAA 1602 

1605 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Lys Glu Gly Gly Arg Thr Ala Ala Cys Cys Ser Arg Pro Lys 
15 10 15 

Val Ala Ala Leu lie Val Gly Thr Leu Leu Phe Leu Thr Gly He Gly 
20 25 30 

Ala Ala Ser Trp- Ala He Val Thr He Leu Leu . Gin. Ser . Asp . Gin Glu 
35 40 45 - 

Pro Leu Tyr Gin Val Gin Leu Ser Pro Gly Asp Ser Arg Leu Ala Val 
50 55 60 

Phe Asp Lys Thr Glu Gly Thr Trp Arg Leu Leu Cys Ser Ser Arg Ser 
^5 70 75 80 

Asn Ala Arg Val Ala Gly Leu Gly Cys Glu Glu Met Gly Phe Leu Arg 

85 90 95 

Ala Leu Ala His Ser Glu Leu Asp Val Arg Thr Ala Gly Ala Asn Gly 
100 105 110 

Thr Ser Gly Phe Phe Cys Val Asp Glu Gly Gly Leu Arg Leu Ala Gin 
115 ' 120 125 

Arg Leu Leu Asp Val He Ser Val Cys Asp Cys Pro Arg Gly Arg Phe 
130 135 140 

Leu Thr Ala Thr Cys Gin Asp Cys Gly Arg Arg Lys Leu Pro Val Asp 
145 150 155 160 

Arg lie Val Gly Gly Gin Asp Ser Ser Leu Gly Arg Trp Pro Trp Gin 

165 170 - 175 
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Val Ser Leu Arg Tyr Asp Gly Thr His Leu Cys Gly Gly Ser Leu Leu 
180 185 190 

Ser Gly Asp Trp Val Leu Thr Ala Ala His Cys Phe Pro Glu Arg Asn 
195 200 205 

Arg Val Leu Ser Arg Trp Arg Val Phe Ala Gly Ala Val Ala Arg Thr 
210 215 220 

Ser Pro His Ala Val Gin Leu Gly Val Gin Ala Val He Tyr His Gly 
225 230 235 240 

Gly Tyr Leu Pro Phe Arg Asp Pro Thr He Asp Glu Asn Ser Asn Asp 
245 250 255 

He Ala Leu Val His Leu Ser Ser Ser Leu Pro Leu Thr Glu Tyr He 
260 265 270 

Gin Pro Val Cys Leu Pro Ala Ala Gly Gin Ala Leu Val Asp Gly Lys 
275 280 285 

Val Cys Thr Val Thr Gly Trp Gly Asn Thr Gin Phe Tyr Gly Gin Gin 
290 295 300 

Ala Met Val Leu Gin Glu Ala Arg Val Pro He He Ser Asn Glu Val 
305 310 315 320 

Cys Asn Ser Pro Asp Phe Tyr Gly Asn Gin lie Lys Pro Lys Met Phe 

325 330 335 

Cys Ala Gly Tyr Pro Glu Gly Gly He Asp Ala Cys Gin Gly Asp Ser 
340 345 350 

Gly Gly Pro Phe Val Cys Glu Asp Ser He Ser Gly Thr Ser Arg Trp 
355 360 365 

Arg Leu Cys Gly He Val Ser Trp Gly Thr Gly Cys Ala Leu Ala Arg 
370 375 380 

Lys Pro Gly Val Tyr Thr Lys Val Thr Asp Phe Arg Glu Trp He Phe 
385 390 395 400 

Lys Ala He Lys Thr His Ser Glu Ala Ser Gly Met Val Thr Gin Pro 

405 410 415 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SENSE PRIMER 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTCGCCATCT GCCAAGACTG 2 0 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

iii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "ANTISENSE PRIMER" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTGAGTCTTT ATGGCCTGGA AGAT ' 2 4 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ' TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SENSE PRIMER" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CAAGATGGAT TGCACGCAGG 
(2) INFORMATION FOR SEQ ID NO: 6: 



20 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "ANTISENSE PRIMER 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GTAAAGCACG AGGAAGCGG 19 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SENSE PRIMER" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

ACGGCACATC GGGCTTCTTT T 21 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
*{A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "ANTISENSE PRIMER" 

(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: 



YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
ATTGCGGTCC ACCGGCAGCT TCCT 
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What is claimed : 

1 . A recombinant nucleic acid molecule comprising: 

a 5' nucleotide sequence which is effective to achieve homologous 
recombination at a first predefined position of a mammalian hepsin gene, operably 
linked to the 5* terminus of a first selectable nucleotide sequence which confers a first 
selection characteristic on a cell in which it is present, 

and a 3' nucleotide sequence which is effective to achieve homologous 
recombination at a second predefined position of the mammalian hepsin gene, operably 
linked to the 3' terminus of the first selectable nucleotide sequence. 

2. A recombinant nucleic acid molecule of claim 1 , further comprising a 
second selectable nucleotide sequence which confers a second selection characteristic 
on a cell in which it is present, 

which second selectable nucleotide sequence is 5* to the 5* nucleotide sequence 
effective to achieve homologous recombination, or which is 3' to the 3' nucleotide 
sequence effective to achieve homologous recombination. 

3. A recombinant nucleic acid molecule of claim 1 which is effective for 
disrupting expression of a murine hepsin gene. 

4. A recombinant nucleic acid molecule of claim 1 , wherein the first 
selection characteristic is a positive selection characteristic. 

5. A recombinant nucleic acid molecule of claim 1, wherein the first 
selection characteristic is coded for by a neomycin resistance gene. 

6. A recombinant nucleic acid molecule of claim 2, wherein the second 
selection characteristic is a negative selection characteristic. 

7. A recombinant nucleic acid molecule of claim 6, wherein the second 
selection characteristic is coded for by a thymidine kinase gene. 
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8. A recombinant nucleic acid molecule of claim 1 , further comprising a 
vector nucleotide sequence. ~~ 

9. A recombinant nucleic acid molecule of claim 1 1 wherein the nucleic acid 

is DNA. 

1 0. A recombinant nucleic acid molecule of claim 1 , wherein the 5' 
nucleotide sequence effective for recombination comprises all or part of exons 9-12 of a 
mouse hepsin gene and the 3* nucleotide sequence effective for recombination 
comprises exon 4 of a mouse hepsin gene, whereby the molecule is effective for 
disrupting expression of the mouse hepsin gene. 

11. A recombinant nucleic acid molecule of claim 1 , whereby the molecule is 
effective for replacing exons 5-8 of a mouse hepsin gene with the first selectable 
nucleotide sequence. 

12. A transformed mammalian cell comprising a recombinant nucleic acid of 

claim 1 . 

13. A transformed mouse cell-comprising a recombinant nucleic acid of 
claim 1 , wherein the hepsin gene sequences are murine. 

14. A transformed mouse cell of claim 13, wherein the mouse cell is an 
embryonic stem cell. 

15. A transformed mouse cell of claim 13, wherein the nucleic acid is 
integrated into the chromosome of the mouse cell at the mouse hepsin gene locus. 

16. A transgenic mouse comprising cells which contain at least one 
functionally disrupted recombinant hepsin gene at a chromosomal hepsin gene locus, 
wherein said disruption prevents functional expression of the hepsin polypeptide coded 
for by the hepsin gene, and wherein the serum alkaline phosphatase levels in the blood 
of said mouse is elevated above the normal range. 
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17. A transgenic mouse of claim 16, wherein said disruption prevents 
expression of the hepsin polypeptide. ~~ 

18. A transgenic mouse of claim 17, wherein at each chromosomal hepsin 
gene locus in said cells is a functionally disrupted recombinant hepsin gene. 

19 A transgenic mouse of claim 16, wherein the cells, which contain at least 
one functionally disrupted recombinant hepsin gene, produce a truncated hepsin 
polypeptide comprising a transmembrane domain. 

20 A transgenic mouse of claim 16, wherein said functionally disrupted 
recombinant hepsin gene comprises a first selectable nucleotide sequence which 
confers a first selection characteristic on the cells in which it is present. 

21 . A transgenic mouse comprising cells which lack exons 5-8 of at least one 
recombinant chromosomal hepsin gene, wherein said recombinant gene does not 
produce a functionally active hepsin polypeptide, and wherein the serum alkaline 
phosphatase levels in the blood of said mouse is elevated above the normal range. 

22 , a method for producing a transgenic mouse having at least one 
functionally disrupted chromosomal hepsin gene, and wherein the serum alkaline 
phosphatase levels in the blood of said mouse is elevated above the normal range ; 
comprising: 

(a) introducing a nucleic acid molecule of claim 1 into mouse ES cells; 

(b) culturing said ES cells under conditions effective for homologous 
recombination between said nucleic acid and a hepsin gene of the ES cells; 

(c) selecting cells having the recombinant nucleic acid molecule integrated 
by homologous recombination into a chromosomal hepsin gene of the ES cells to obtain 
transformed ES cells; 

(d) introducing the transformed ES cells into a blastocyst; 

(e) implanting the blastocyst into a pseudopregnant mouse; and 

(f) allowing said embryo to develop to term. 

23 . A transgenic mouse produced according to claim 22 . 
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24 . A method of claim 22 further comprising identifying at least one 
transgenic embryo containing the recombinant nucleic acid molecule. 

25 . A method of claim 22, further comprising breeding an embryo upon 
maturation to form a transgenic line of mice comprising a functionally disrupted 
chromosomal hepsin gene. 

26 . A method of identifying an agent which modulates an activity of a 
mammalian hepsin comprising: 

administering an agent which modules an activity of a hepsin to a first cell in 
which expression of at least one copy of a hepsin gene is functionally disrupted, 

administering said agent to a second cell which contains at least one more copy 
than the first cell of an endogenous hepsin gene whose expression is complete, and 

measuring the amount of hepsin activity in the first and second cell. 

2 7. An isolated nucleic acid comprising a nucleotide sequence coding for a 
mouse hepsin polypeptide. 

2 8. An isolated nucleic acid of claim 2 7 wherein said nucleic acid has a 
sequence as set forth in Fig. 2. 
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GTCAACCTGGGAATCATTAACAAGAGTCCCTGACATGGCGAAGGAGGGTGGCCGGACTGC 

MAKEGGRTA 

AGCATGCTGCTCCAGACCCAAGGTGGCAGCTCTCATTGTGGGTACCCTGCTGTTCCTGAC 
ACCSRPKVAALIVGTLLFLT 

AGGCATTGGGGCCGCGTCCTGGGCCATTGTGACCATCCTACTGCAGAGTGACCAGGAGCC 
GIGAASWAIVTILLQSDQEP 

ACTGTACCAAGTGCAGCTCAGTCCAGGGGACTCACGGCTTGCGGTGTTTGACAAGACGGA 
LYQVQLSPGDSRLAVFDKTE 

GGGAACGTGGAGGCTACTGTGCTCCTCACGCTCCAATGCCAGGGTGGCAGGGCTCGGCTG 
GTWRLLCSSRSNARVAGLGC 

TGAGGAGATGGGCTTTCTCAGGGCTCTGGCGCACTCGGAGCTGGATGTCCGCACTGCGGG 
EEMGFLRALAHS ELDVRTAG 

CGCCAACGGCACATCGGGCTTCTTTTGCGTGGACGAGGGCGGACTCCGTCTGGCTCAGAG 
ANGTSGPPCVDEGGLRLAQR 

GTTGCTGGATGTCATCTCTGTATGTGACTGTCCTAGAGGCCGATTCCTGACTGCCACCTG 
LLDVISVCDCPRGRFLTATC 

CCAAGACTGTGGCCGCAGGAAGCTGCCGGTGGACCGCATTGTGGGGGGCCAGGACAGCAG 
Q DCGRRKLPVDRIVGGQDS S 

TCTGGGAAGGTGGCCGTGGCAGGTCAGCCTGCGTTATGATGGGACCCACCTCTGTGGGGG 
LGRWPWQVSLRYDGTHLCGG 

GTCCCTGCTGTCTGGGGACTGGGTGCTGACTGCTGCACATTGCTTTCCAGAGCGGAACCG 
SLLSGDWVLTAAHCFPERNR 

GGTCCTGTCTCGGTGGCGAGTATTTGCTGGTGCTGTAGCCCGGACCTCACCCCATGCTGT 
VLSRWRVFA GAVARTSPHAV 

GCAACTGGGGGTTCAGGCTGTGATCTATCATGGGGGCTACCTTCCCTTTCGAGACCCTAC 
QLGVQAV IYHGGYLPFRDPT 

TATCGACGAAAACAGCAATGACATTGCCTTGGTCCACCTCTCTAGCTCCCTGCCTCTCAC 
IDENSNDIALVHLSSSLPLT 

AGAATACATCCAGCCAGTGTGTCTCCCTGCTGCGGGACAGGCCCTGGTGGATGGCAAGGT 
EYIQPVCLPAAGQALVDGKV 

CTGTACTGTGACCGGCTGGGGTAACACACAGTTCTATGGCCAACAGGCTATGGTGCTCCA 
CTVTGWGNTQFYGQQAMVLQ 

AGAGGCCCGGGTTCCCATCATAAGCAACGAAGTTTGCAACAGCCCCGACTTCTACGGGAA 
EARVPI I SN EVCNS PDFYGN 

TCAGATCAAGCCCAAGATGTTCTGTGCTGGCTATCCTGAGGGTGGCATTGATGCGTGCCA 
QIKPKMFCAGYPEGGIDACQ 
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GGGCGACAGTGGAGGCCCCTTTGTGTGTGAAGACAGCATCTCTGGGACATCAAGGTGGCG 
GDSGGPFVCEDSISGTSRWR 

GCTATGTGGCATTGTAAGCTGGGGTACGGGCTGTGCTTTGGCCCGGAAGCCAGGAGTGTA 
LCGIVSWGTGCALARKPGVY 

CACCAAAGTCACTGACTTCCGGGAGTGGATCTTCAAGGCCATAAAGACTCACTCCGAAGC 
TKVTDFREWI FKAIKTHSEA 

CAGTGGCATGGTGACTCAGCCCTGATCCCGCCTCATCTCGCTGCTCCGTGCTGCACTAGC 
SGMVTQP* 

ATCCAGAGTCAGAGTTGGTCTGGTGGCTCCAGCCCCACGTGGTAGGCTCCACACTGGGCC 
TCACATGGAATGGTTTCCTGCTCAGATCCAGTCCACGGGTCCAAGGATGCTGGATCCAAG 
GACTTCTCTTCCACAGTGGCCGGCCCACTCAATCCCAGGGCCATTGGCCTCACCCTCCCA 
CCCCATGTAAATATTACTCTGTCCTCTGGGGGGCGCTCTAGGGAGCCCTTGTGCAGATGC 
TCTTTAAATAATAAAGGTGGTTTTGATTAAAAAAAAAAAAAAAAA 

Fi gure 2. Mouse hepsin cDNA and deduced amino acid sequences. 
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